WHAT IS CLAIMED IS: 



1 1 . A method of performing speech recognition across a network 

2 comprising: 

3 downloading first recognition information from a remote server to a first 

4 computer to recognize a first plurality of words; 

5 programming the first computer with the first recognition information to 

6 recognize the first plurality of words; 

7 receiving at least one of the first plurality of words in the first computer; 

8 generating first recognition results in response to receiving said at least one of 

9 the first plurality of words; 

10 downloading second recognition information from the remote server to the 

1 1 first computer to recognize a second plurality of words, wherein the second recognition 

12 information is selected based on the first recognition results; and 

13 programming the first computer with the second recognition information to 

14 recognize a second plurality of words. 

1 2 . The method of claim 1 wherein the first computer is connected to the 

2 server over an internet. 

1 3. The method of claim 2 wherein the first and second recognition 

2 information is downloaded from a internet web site. 

1 4. The method of claim 1 wherein the first computer is connected to the 

2 server over an intranet. 

1 5. The method of claim 4 wherein the first and second recognition 

2 information is downloaded from a intranet web site. 

1 6. The method of claim 1 wherein the first computer is connected to the 

2 server over a local network. 

1 7. The method of claim 1 wherein the first computer includes a software 

2 recognition engine. 

1 8. The method of claim 7 wherein the software recognition engine runs in 

2 a general purpose microprocessor. 



12 



1 9. The method of claim 1 wherein recognition is performed using 

2 speaker-independent speech recognition. 

1 10. The method of claim 9 wherein the first and second recognition 

2 information comprise neural network weights. 

1 1 1 . A method of performing speech recognition across a network 

2 comprising: 

3 providing, from a server to a first computer, sets of data to recognize spoken 

4 utterances from corresponding limited sets of candidate utterances; and 

5 supplying different sets of said data from the server to the first computer to 

6 recognize different spoken utterances from corresponding limited sets of candidate utterances 

7 at different times in response to different user interactions. 

1 12 . The method of claim 1 1 wherein the first computer is connected to the 

2 server over an internet. 

1 13. The method of claim 12 wherein the first and second information is 

2 downloaded from a internet web site. 

1 14. The method of claim 1 1 wherein the first computer is connected to the 

2 server over an intranet. 

1 15. The method of claim 14 wherein the first and second information is 

2 downloaded from a intranet web site. 

1 16. The method of claim 1 1 wherein the first computer is connected to the 

2 server over a local network. 

1 17. The method of claim 1 1 wherein the first computer includes a software 

2 recognition engine. 

1 18. The method of claim 1 7 wherein the software recognition engine runs 

2 in a general purpose microprocessor. 

1 19. The method of claim 1 1 wherein recognition is performed using 

2 speaker-independent speech recognition. 
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1 20. The method of claim 19 wherein the first and second information 

2 comprise neural network weights. 

1 21 . The method of claim 1 1 further comprising prompting a user to input a 

2 first spoken utterance corresponding to a first limited set of candidate utterances and 

3 prompting a user to input a second spoken utterance corresponding to a second limited set of 

4 candidate utterances. 

1 22. The method of claim 1 1 wherein said data further includes synthesis 

2 data. 

1 23. The method of claim 1 1 wherein said data further includes video data. 

1 24. The method of claim 1 1 wherein recognition is performed using 

2 speaker-independent speech recognition. 

1 25. The method of claim 24 wherein the first and second information 

2 comprise neural network weights. 

1 26. A system for performing speech recognition across a network 

2 comprising: 

3 a server including information to recognize a plurality of spoken utterances; 

4 and 

5 a first computer including a recognition engine, wherein the first computer is 

6 coupled to the server by said network, 

7 wherein the server supplies different sets of information to the first computer 

8 to recognize different spoken utterances from corresponding limited sets of candidate 

9 utterances at different times in response to different user interactions. 

1 27. The system of claim 26 wherein the server supplies the first computer 

2 with first information for recognizing a spoken utterance from a first limited set of candidate 

3 utterances, and the first computer is programmed with the first information to recognize the 

4 first limited set of candidate utterances. 
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1 28. The system of claim 27 wherein the first computer generates first 

2 recognition results in response to receiving a spoken utterance from the first limited set of 

3 candidate utterances. 

1 29. The system of claim 28 wherein the server supplies the first computer 

2 with second information for recognizing a spoken utterance from a second limited set of 

3 candidate utterances, wherein the second information is selected based on the first 

4 recognition results, and the first computer is programmed with the second information to 

5 recognize the second limited set of candidate utterances. 

1 30. The system of claim 26 wherein the first computer is connected to the 

2 server over an internet. 

1 31. The system of claim 26 wherein the first computer is connected to the 

2 server over an intranet. 

1 32. The system of claim 26 wherein the first computer is connected to the 

2 server over a local network. 

1 33. The system of claim 26 wherein the recognition engine comprises 

2 software running in a general purpose microprocessor. 

1 34. The system of claim 26 wherein recognition is performed using 

2 speaker-independent speech recognition. 

1 35. The system of claim 34 wherein the recognition information comprises 

2 neural network weights. 
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